Display apparatus and method for question and answer

ABSTRACT

A display apparatus and a method for questions and answers includes a display unit includes an input unit configured to receive user&#39;s speech voice; a communication unit configured to perform data communication with an answer server; and a processor configured to create and display one or more question sentences using the speech voice in response to the speech voice being a word speech, create a question language corresponding to the question sentence selected from among the displayed one or more question sentences, transmit the created question language to the answer server via the communication unit, and, in response to one or more answer results related to the question language being received from the answer server, display the received one or more answer results. Accordingly, the display apparatus may provide an answer result appropriate to a user&#39;s question intention although a non-sentence speech is input.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation application of U.S. patentapplication Ser. No. 16/420,788, filed on May 23, 2019, which is acontinuation application of U.S. patent application Ser. No. 14/919,856,filed on Oct. 22, 2015, which claims priority from Korean PatentApplication No. 10-2014-0156901, filed on Nov. 12, 2014, in the KoreanIntellectual Property Office, the disclosures of which are incorporatedherein by reference in their entirety.

BACKGROUND 1. Field

The following description relates to a display apparatus and method forquestions and answers, and more particularly, to a display apparatus andmethod that provide a question and answer service related tonon-sentence speech.

2. Description of the Related Art

Because display apparatuses have been capable of recognizing user'svoice, display apparatuses have been capable of recognizing user'sspeech voice to provide a voice recognition result intended by users orperform a control command.

In particular, the most universal method related to a voice conversationservice that provides the voice recognition result intended by users isa keyword recognition method that performs a conversation between usersand display apparatuses through a search engine. Such a keywordrecognition method refers to display apparatuses that perform a searchrelated to a core keyword based on speech voice related to the corekeyword and provide a search result. However, such a method providesnumerous search results related to the speech voice for the corekeyword, and thus users experience inconvenience by having to search fordesired information from numerous search results.

Another method is a sentence recognition method, where displayapparatuses analyze speech voice related to a question sentence of auser to determine a user's speech intention, perform a search based onthe determined speech intention, and provide an answer result accordingto the search.

Such a sentence recognition method is advantageous compared to thekeyword recognition method described above in that the sentencerecognition method provides an answer result closer to the user's speechintention. However, when a sentence speech appropriate to a presetpattern is not input, or noise occurs in the sentence speech due to anexternal environment, such a sentence recognition method does notperform correct voice recognition on the sentence speech of a user.

SUMMARY

Exemplary embodiments of the present disclosure overcome the abovedisadvantages and other disadvantages not described above. Also, thepresent disclosure is not required to overcome the disadvantagesdescribed above, and an exemplary embodiment of the present disclosuremay not overcome any of the problems described above.

The present disclosure provides a display apparatus that clearlydetermines a user's question intention from the user's non-sentencespeech voice and provides an answer result according to thedetermination.

According to an aspect of the present disclosure, a display apparatusincluding a display unit includes an input unit configured to receiveuser's speech voice; a communication unit configured to perform datacommunication with an answer server; and a processor configured tocreate and display one or more question sentences using the speech voiceif the speech voice is a word speech, create a question languagecorresponding to the question sentence selected from among the displayedone or more question sentences, transmit the question language to theanswer server via the communication unit, and, if one or more answerresults related to the question language are received from the answerserver, display the received one or more answer results.

The display apparatus may further include: a storage unit configured tostore a plurality of sentences and keywords corresponding tocharacteristic vectors for the plurality of respective sentences,wherein the processor compares similarity in a pronunciation columnbetween the keywords corresponding to the characteristic vectors of theplurality of respective sentences previously stored in the storage unitand the spoken word, determines a sentence including a keyword having ahigh similarity with the spoken word as the question sentence related tothe speech voice, and displays the determined sentence.

If a plurality of sentences are determined as the question sentences,the processor may display the plurality of sentences selected as thequestion sentences in order of high number of times previously selectedas the question sentences based on selection history information of eachof the plurality of sentences.

If a plurality of sentences are determined as the question sentences,the processor may select and display sentences corresponding to a presetnumber from a sentence having highest number of times previouslyselected as the question sentences from among the plurality of sentencesselected as the question sentences based on selection historyinformation of each of the plurality of sentences.

If keywords related to the spoken word (or utterance word) are receivedfrom the answer server, the processor may combine the received keywords,create the question sentence with respect to the speech voice, anddisplay the question sentence.

The answer server may be a triple structure knowledge base server andextract keywords associated with the spoken word using attributeinformation related to the spoken word.

When there is no sentence including the keyword having the highsimilarity with the spoken word, the processor may receive the spokenword and the keywords from the answer server, combine the receivedkeywords, and create a question sentence related to the speech voice.

If the user's speech voice is a sentence speech, the processor mayextract an object name from the user's speech voice using naturallanguage processing based algorithm and creates a question languagebased on the extracted object name.

According to an aspect of the present disclosure, a method, performed bya display apparatus, of providing questions and answers includesreceiving user's speech voice; if the user's speech voice is a wordspeech, creating and displaying one or more question sentences using theuser's speech voice; creating a question language corresponding to thequestion sentence selected from among the displayed one or more questionsentences and transmitting the question language to an answer server;and receiving and displaying one or more answer results related to thequestion language from the answer server.

The creating and displaying of the one or more question sentences mayinclude: comparing similarity in a pronunciation column between thekeywords corresponding to the characteristic vectors each of a pluralityof previously stored sentences and the spoken word, determining asentence including a keyword having a high similarity with the spokenword as the question sentence, and displaying the determined sentence.

The creating and displaying of the one or more question sentences mayinclude: if a plurality of sentences are determined as the questionsentences, displaying the plurality of sentences selected as thequestion sentences in order of high number of times previously selectedas the question sentences based on selection history information of eachof the plurality of sentences.

The creating and displaying of the one or more question sentences mayinclude: if a plurality of sentences are determined as the questionsentences, selecting and displaying sentences corresponding to a presetnumber from a sentence having highest number of times previouslyselected as the question sentences from among the plurality of sentencesselected as the question sentences based on selection historyinformation of each of the plurality of sentences.

The creating and displaying of the one or more question sentences mayinclude: if keywords related to the spoken word are received from theanswer server, combining the received keywords, creating the questionsentence with respect to the speech voice, and displaying the createdquestion sentence.

The answer server may be a triple structure knowledge base server andextract keywords associated with the core vocabulary (or core word)using attribute information related to the core vocabulary.

The creating and displaying of the one or more question sentences mayinclude: when there is no sentence including the keyword having the highsimilarity with the spoken word, receiving the keywords associated withthe spoken word from the answer server, combining the received keywords,and creating and displaying a question sentence related to the speechvoice.

The method may further include determining whether the user's speechvoice is a word speech or a sentence speech, wherein the transmittingincludes, if the user's speech voice is a sentence speech, extracting anobject name from the user's speech voice using natural languageprocessing based algorithm, creating a question language based on theextracted object name, and transmitting the created question language tothe answer server.

According to an aspect of the present disclosure, a non-transitorycomputer-readable storage medium coupled to a display apparatus thatprovides questions and answers and storing a computer program forexecuting the following operations of: receiving user's speech voice; ifthe user's speech voice is a word speech, creating and displaying one ormore question sentences using the user's speech voice; creating aquestion language corresponding to the question sentence selected fromamong the displayed one or more question sentences and transmitting thequestion language to an answer server; and receiving and displaying oneor more answer results related to the question language from the answerserver.

Additional and/or other aspects and advantages of the disclosure will beset forth in part in the description which follows and, in part, will beobvious from the description, or may be learned by practice of thedisclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and/or other aspects of the present disclosure will be moreapparent by describing certain exemplary embodiments of the presentdisclosure with reference to the accompanying drawings, in which:

FIG. 1 is a block diagram of a conversation type system according to anexemplary embodiment of the present disclosure;

FIG. 2 is a schematic block diagram of a display apparatus according toan exemplary embodiment of the present disclosure;

FIG. 3 is a detailed block diagram of a display apparatus according toan exemplary embodiment of the present disclosure;

FIG. 4 is an example diagram of describing a display apparatus thatcreates a question sentence according to an exemplary embodiment of thepresent disclosure;

FIG. 5 is an example diagram of describing a display apparatus thatdisplays a question sentence according to an exemplary embodiment of thepresent disclosure;

FIG. 6 is an example diagram of describing a display apparatus thatreceives keywords related to question sentences through an answer serveraccording to an exemplary embodiment of the present disclosure;

FIG. 7 is an example diagram of describing a display apparatus thatdisplays question sentences based on keywords provided by an answerserver according to an exemplary embodiment of the present disclosure;and

FIG. 8 is a flowchart of a method of a display apparatus that provides aquestion and answer service according to an exemplary embodiment of thepresent disclosure.

DETAILED DESCRIPTION

Reference will now be made in detail to the embodiments, examples ofwhich are illustrated in the accompanying drawings, wherein likereference numerals refer to like elements throughout. The embodimentsare described below to explain the present disclosure by referring tothe figures.

The exemplary embodiments of the present disclosure may be diverselymodified. Accordingly, specific exemplary embodiments are illustrated inthe drawings and are described in detail in the detailed description.However, it is to be understood that the present disclosure is notlimited to a specific exemplary embodiment, but includes allmodifications, equivalents, and substitutions without departing from thescope and spirit of the present disclosure. Also, well-known functionsor constructions are not described in detail because they would obscurethe disclosure with unnecessary detail.

The terms “first”, “second”, etc. may be used to describe diversecomponents, but the components are not limited by the terms. The termsare only used to distinguish one component from the others.

The terms used in the present disclosure are only used to describe theexemplary embodiments, but are not intended to limit the scope of thedisclosure. The singular expression also includes the plural meaning aslong as it does not differently mean in the context. In the presentapplication, the terms “include” and “consist of” designate the presenceof features, numbers, steps, operations, components, elements, or acombination thereof that are written in the specification, but do notexclude the presence or possibility of addition of one or more otherfeatures, numbers, steps, operations, components, elements, or acombination thereof.

In the exemplary embodiment of the present disclosure, a “module” or a“unit” performs at least one function or operation, and may beimplemented with hardware, software, or a combination of hardware andsoftware. In addition, a plurality of “modules” or a plurality of“units” may be integrated into at least one module except for a “module”or a “unit” which has to be implemented with specific hardware, and maybe implemented with at least one processor (not shown).

Hereinafter, the present disclosure will be described in detail withreference to the accompanying drawings.

In the following description, same drawing reference numerals are usedfor the same elements even in different drawings. The matters defined inthe description, such as detailed construction and elements, areprovided to assist in a comprehensive understanding of the disclosure.Thus, it is apparent that the exemplary embodiments of the presentdisclosure can be carried out without those specifically definedmatters. Also, well-known functions or constructions are not describedin detail because they would obscure the disclosure with unnecessarydetail.

FIG. 1 is a block diagram of a conversation type system according to anexemplary embodiment of the present disclosure.

As illustrated in FIG. 1 , the conversation type system includes adisplay apparatus 100, a voice recognition apparatus 200, and an answerserver 300.

The display apparatus 100 is an apparatus that recognizes user's speechvoice to perform an operation intended by a user and may be implementedas various electronic apparatuses such as a smart TV, a smart phone, atablet PC, etc.

The voice recognition apparatus 200 converts a voice signal of theuser's speech voice received from the display apparatus 100 into textand transmits a voice recognition result converted into text to thedisplay apparatus 100. However, the present disclosure is not limitedthereto. When data communication between the display apparatus 100 andthe voice recognition apparatus 200 is not performed or the speech voiceis input via a microphone (not shown), the display apparatus 100 mayconvert the voice signal of the input user's speech voice into text.

The answer server 300 is a triple structure knowledge base server andprovides keywords associated with a word spoken by a user when the textconverted from the user's speech voice includes the word. The answerserver 300 provides answer information related to a question languagereceived from the display apparatus 100.

In more detail, while the display apparatus 100 operates in a voicerecognition mode, the user may perform a question related speechregarding desired information in order to receive the desiredinformation through the display apparatus 100. According to an exemplaryembodiment, if the user's speech voice is input via the microphone (notshown), the display apparatus 100 may internally perform voicerecognition on the user's speech voice and convert the input user'sspeech voice into text. If the user's speech voice is input via a remotecontrol apparatus (not shown) such as a remote controller, the displayapparatus 100 transmits the voice signal of the user's speech voice tothe voice recognition apparatus 200. Accordingly, the voice recognitionapparatus 200 converts the voice signal of the speech voice receivedfrom the display apparatus 100 into text and transmits the voicerecognition result according to the converted text.

If the user's speech voice is converted into text through such aprocess, the display apparatus 100 analyzes text converted from theuser's speech voice and determines whether the speech voice is asentence speech or is a speech including one or more words.

As a result of determination, if the user's speech voice is the speechincluding one or more words, the display apparatus 100 displays one ormore question sentences related to the one or more words. In thisregard, the one or more question sentences are sentences predicted as auser's question intention and may be obtained or created through thefollowing exemplary embodiments.

According to an exemplary embodiment, the display apparatus 100 maydetermine sentences including keywords related to one or more wordsspoken by the user among a plurality of previously stored sentences asquestion sentences. According to an exemplary embodiment, the displayapparatus 100 may receive keywords associated with a word spoken by theuser from the answer server 300, combine the received keywords, andcreate question sentences related to the speech voice. Operations ofdetermining and creating a question sentence through the exemplaryembodiments will be described in detail below.

Thereafter, the display apparatus 100 may display the one or morequestion sentences predicted as the user's question intention from theuser's speech voice. Accordingly, the user may select a questionsentence that matches the user's question intention from among the oneor more question sentences displayed on the display apparatus 100.

Meanwhile, if the user selects the question sentence or the user'sspeech voice is the sentence speech, the display apparatus 100 maydetermine a sentence structure using a natural language processing basedalgorithm and extract an object name present in the sentence.Thereafter, the display apparatus 100 may create a question languagebased on the extracted object name. In this regard, the questionlanguage is a language recognizable by the answer server 300. Accordingto exemplary embodiments, the question language may be a SPARQL protocoland RDF query language (SPARQL) based language. The SPARQL basedlanguage may be a language for questioning a document of a semantic webontology language (RDF (Resource Description Framework), RDFs (ResourceDescription Framework Schema), OWL (ONTOLOO Web Language)) provided inthe W3C (World Wide Web Consortium) standard.

If such a SPARQL based question language is created, the displayapparatus 100 transmits the created question language to the answerserver 300. The answer server 300 that has received the questionlanguage from the display apparatus 100 may obtain one or more answerresults based on the received question language and transmit the answerresults to the display apparatus 100. Accordingly, the display apparatus100 may receive the one or more answer results transmitted from theanswer server 300 and display the one or more answer results on ascreen, and thus the user may be provided with the answer result that ismost appropriate to the user's question intention through the one ormore answer results displayed on the screen of the display apparatus100.

The operations in which the conversation type system according to theexemplary embodiment provides a question and answer service appropriateto a user's question intention have been described in detail above.Operations in which the display apparatus 100 according to the exemplaryembodiment provides a question and answer service appropriate to auser's question intention will be described in detail below.

FIG. 2 is a schematic block diagram of the display apparatus 100according to an exemplary embodiment of the present disclosure. FIG. 3is a detailed block diagram of the display apparatus 100 according to anexemplary embodiment of the present disclosure.

As shown in FIG. 2 , the display apparatus 100 may include an input unit110, a display unit 120, a communication unit 130, and a processor 140.In addition, the display apparatus 100 may further include a voiceprocessing unit 150 and a storage unit 160 as shown in FIG. 3 .

The input unit 110 is for receiving and transferring various usermanipulations to the processor 140 and may be implemented as an inputpanel. In this regard, the input panel may include a touch pad, a keypad including various function keys, number keys, special keys,character keys, etc. or a touch screen. In particular, the input unit110 may receive a user's speech voice via a microphone (not shown).

The display unit 120 may display question sentences related to theuser's speech voice and answer results of the user's speech voice. Thedisplay unit 120 may be implemented as a liquid crystal display (LCD),an organic light emitting display (OLED), or a plasma display panel(PDP), etc. In addition, the display unit 120 may be implemented as atouch panel and receive a user touch instruction.

The communication unit 130 may perform data communication with theanswer server 300 and receive an answer result related to the user'sspeech voice. The communication unit 130 may perform data communicationwith a remote control apparatus (not shown) such as a remote controllerand receive a voice signal of the user's speech voice. The communicationunit 130 may include various communication modules such as a neardistance wireless communication module (not shown), a wirelesscommunication module (not shown), etc.

In this regard, the near distance wireless communication module (notshown) is a communication module that performs wireless communicationwith a conversation type server 200 located at a near distance and anexternal server (not shown) providing content, and may be, for example,Bluetooth, Zigbee, etc. The wireless communication module (not shown) isa module connected to an external network according to a wirelesscommunication protocol such as WiFi, IEEE, etc. and performingcommunication. In addition, the wireless communication module mayfurther include a mobile communication module that accesses a mobilecommunication network and performs communication according to variousmobile communication standards such as 3^(rd) generation (3G), 3^(rd)generation partnership project (3GPP), long term evolution (LTE), etc.

The processor 140 is for controlling an apparatus, may be used with acentral processing unit, a microprocessor, a controller, etc., and isused to control general operations of the apparatus. The processor 140may be coupled to a different function part such as the voice processingunit 150, the communication unit 130, etc. and implemented as asystem-on-a-chip (SOC) or a system on chip (SoC).

If the speech voice is word text, the processor 140 may control thedisplay unit 120 to display one or more question sentences related toone or more words. Thereafter, the processor 140 may control thecommunication unit 130 to analyze a question sentence selected by a userfrom among the displayed one or more question sentences, create aquestion language corresponding to the question sentence, and transmitthe generated question language to the answer server 300. Thereafter, ifone or more answer results related to the question language are receivedfrom the answer server 300, the display unit 120 may display the one ormore answer results received from the answer server 300 according to acontrol instruction of the processor 140.

In more detail, the processor 140 may obtain text of the user's speechvoice based on the following exemplary embodiments.

According to an exemplary embodiment, the processor 140 may determine asubject that is to perform voice recognition on the user's speech voiceaccording to whether speech voice related to the user's speech voice isinput in the input unit 110 via the microphone (not shown) or the remotecontrol apparatus (not shown).

If the user's speech voice is input in the input unit 110, the processor140 may control the voice processing unit 150 to perform voicerecognition on the user's speech voice. According to such a controlinstruction, the voice processing unit 150 may convert the user's speechvoice into text using a speech to text (STT) algorithm.

Meanwhile, if the voice signal of the user's speech voice is receivedthrough the communication unit 130, the processor 140 may transmit thevoice signal of the user's speech voice to the voice recognitionapparatus 200. Accordingly, the voice recognition apparatus 200 mayperform voice recognition on the received voice signal of the user'sspeech voice, convert the voice signal of the user's speech voice intotext, and transmit a voice recognition result to the display apparatus100.

According to an exemplary embodiment, the processor 140 may determinethe subject that is to perform voice recognition on the user's speechvoice according to whether to be able to perform data communication withthe voice recognition apparatus 200 through the communication unit 130.If the processor 140 is able to perform data communication with thevoice recognition apparatus 200, the processor 140 may receive the voicerecognition result regarding the user's speech voice from the voicerecognition apparatus 200. If the processor 140 is unable to performdata communication with the voice recognition apparatus 200, theprocessor 140 may perform voice recognition on the user's speech voicethrough the voice processing unit 150.

If text of the user's speech voice is created through the aboveexemplary embodiment, the processor 140 may analyze the text createdfrom the user's speech voice and determine whether a user speech is asentence speech or a word speech. According to an exemplary embodiment,the processor 140 may analyze a sentence structure of the text of thespeech voice through a morpheme analysis and a syntax analysis that arenatural language processing technologies and determine whether the userspeech is the sentence speech or the word speech. Such a naturallanguage processing technology is well known, and thus a detaileddescription thereof is omitted in the present disclosure.

If the user speech is the speech including one or more words through atext analysis of the speech voice, the processor 140 may determine aquestion sentence predicted as a user's intention and display thedetermined question sentence on the display unit 120 according to thefollowing exemplary embodiments.

According to an exemplary embodiment, the processor 140 may determineone or more sentences including keywords related to one or more wordsspoken by the user among a plurality of previously stored sentences asthe question sentences related to the user's speech voice. In moredetail, the storage unit 160 may store the plurality of sentences andkeywords corresponding to characteristic vectors for the plurality ofrespective sentences. Thus, the processor 140 may compare similarity ina pronunciation column between the keywords corresponding to thecharacteristic vectors of the plurality of sentences previously storedin the storage unit 160 and the one or more words. Thereafter, theprocessor 140 may determine a sentence including a keyword having a highsimilarity with the one or more words spoken by the user as the questionsentence related to the speech voice.

According to an exemplary embodiment, the processor 140 may transmit theone or more words spoken by the user to the answer server 300, ifreceiving keywords associated with the one or more words from the answerserver 300, combine the received keywords, and create the questionsentence with respect to the user's speech voice. In this regard, theanswer server 300 is a triple structure knowledge base server and mayperform an operation of extracting keywords associated with a wordspoken by the user using attribute information related to the word andtransmitting the extracted keywords to the display apparatus 100.

According to an exemplary embodiment, when there is no sentenceincluding the keywords related to the one or more words spoken by theuser among the plurality of previously stored sentences, the processor140 may transmit the one or more words to the answer server 300. Thus,the answer server 300 may perform an operation of extracting keywordsassociated with the one or more words received from the displayapparatus 100 using attribute information related to the one or morewords and transmitting the keywords to the display apparatus 100.

As described above, if the question language related to the questionsentence is received, the answer server 300 providing the keywordsrelated to the one or more words received from the display apparatus 100may perform an operation of transmitting an answer result of thequestion sentence based on the received question language to the displayapparatus 100. In this regard, the question language may be a SPARQLprotocol and RDF query language (SPARQL) based language recognizable bythe answer server 300. As described above, the answer server 300 thatprovides the keywords for creating the question sentence and providesthe answer result with respect to the question sentence based on thequestion language in the SPARQL based language will be described indetail below.

FIG. 4 is an example diagram of describing a display apparatus thatcreates a question sentence according to an exemplary embodiment of thepresent disclosure.

As shown in FIG. 4 , the storage unit 160 may match and store aplurality of sentences and keywords corresponding to characteristicvectors for the plurality of respective sentences. Meanwhile, wordsextracted from user's speech voice may include “AAA”, “movie”, and“cast”. If such words are extracted, the processor 140 may measuresimilarity between the keywords corresponding to the characteristicvectors of the plurality of sentences previously stored in the storageunit 160 and the words extracted from the user's speech voice.

In more detail, the processor 140 may generate a phonetic symbol of eachof “AAA”, “movie”, and “cast” that are the words extracted from theuser's speech voice. Thereafter, the processor 140 may measuresimilarity between phonetic symbols of the keywords corresponding to thecharacteristic vectors of the plurality of respective sentencespreviously stored in the storage unit 160 and phonetic symbols ofrespective “AAA”, “movie”, and “cast” using a similarity algorithm suchas a confusion matrix. Thereafter, the processor 140 may extract asentence including a keyword measured to have a similarity equal to orgreater than a predetermined threshold value related to “AAA”, “movie”,and “cast” from among the plurality of previously stored sentences basedon a similarity measurement result.

As shown in FIG. 4 , the storage unit 160 may store first through sixthsentences 410, 420, 430, 440, 450, and 460 having high similarityrelated to at least one of “AAA”, “movie”, and “cast”. As describedabove, if the first through sixth sentences 410 through 460 related toat least one of “AAA”, “movie”, and “cast” are stored, the processor 140may extract a sentence having high similarity with respect to each of“AAA”, “movie”, and “cast” from the first through sixth sentences 410through 460. As shown in FIG. 4 , a first keyword with respect to “AAB”among keywords matching the sixth sentence 460 may have low similaritywith “AAA”. In this case, the processor 140 may determine the firstthrough fifth sentences 410 through 450 except for the sixth sentence460 as sentences having high similarity with “AAA”, “movie”, and “cast”and determine the first through fifth sentences 410 through 450 asquestion sentences predictable as a user's speech intention.

FIG. 5 is an example diagram of describing a display apparatus thatdisplays a question sentence according to an exemplary embodiment of thepresent disclosure.

As described with reference to FIG. 4 above, the first through fifthsentences 410 through 450 may be determined as question sentencespredictable as a user's speech intention in connection with a word 510for each of “AAA”, “movie”, and “cast”. As described above, if theplurality of question sentences are determined in connection with theword 510 for each of “AAA”, “movie”, and “cast”, the processor 140 maycontrol the display unit 120 to display the first through fifthsentences 410 through 450 selected as the question sentences in order ofhighest number of times selected as the question sentence based onselection history information of each of the first through fifthsentences 410 through 450. For example, in connection with the firstthrough fifth sentences 410 through 450 selected as the questionsentences, the first through fifth sentences 410 through 450 may be mostselected by a user as the question sentences in order of the fourthsentence 440, the first sentence 410, the third sentence 430, the secondsentence 420, and the fifth sentence 450.

In this case, the display unit 120 may display a question sentence list520 in which the fourth sentence 440, the first sentence 410, the thirdsentence 430, the second sentence 420, and the fifth sentence 450 aresequentially listed on a screen.

Meanwhile, the processor 140 may control the display unit 120 to displayonly a predetermined number among the first through fifth sentences 410through 450 selected as the question sentence. For example, if 3 is setas the number of question sentences that are to be included in thequestion sentence list 520, the processor 140 may determine the topthree sentences, as the question sentences, in order of highest numberof times previously selected as the question sentences among the firstthrough fifth sentences 410 through 450. Thus, the display unit 120 maydisplay the question sentence list 520 including the fourth sentence440, the first sentence 410, and the third sentence 430 that havehighest number of times previously selected as the question sentencesamong the first through fifth sentences 410 through 450 on the screen.

Therefore, the user may select a question sentence intended by the userfrom among question sentences listed on the question sentence list 520displayed on the screen.

FIG. 6 is an example diagram of describing the display apparatus 100that receives keywords related to question sentences through the answerserver 300 according to an exemplary embodiment of the presentdisclosure. FIG. 7 is an example diagram of describing the displayapparatus 100 that displays question sentences based on keywordsprovided by the answer server 300 according to an exemplary embodimentof the present disclosure.

As shown in FIG. 6 , user's speech voice may include a vocabulary columnincluding a first word 610 “AAA” and a second word 620 “movie”. In thiscase, the processor 140 may transmit text of the first and second words610 and 620 to the answer server 300 that is a triple structureknowledge base server through the communication unit 130. If the text ofthe first and second words 610 and 620 is received, the answer server300 may extract keywords associated with the first and second words 610and 620 using attribute information of the first and second words 610and 620.

In this regard, the answer server 300 is the triple structure knowledgebase server and may extract keywords associated with each word usingattribute information related to each word. In more detail, the answerserver 300 may obtain higher attribute information for a “movie” fromthe first and second words 610 and 620. If the higher attributeinformation for a “movie” is obtained from the first and second words610 and 620, the answer server 300 may obtain lower attributeinformation b of each of “actor”, “director”, and “OST” associated withthe higher attribute information a. Thus, the answer server 300 maydetermine the higher attribute information for a “movie” associated withthe first and second words 610 and 620 and the lower attributeinformation b of each of “actor”, “director”, and “OST” as keywordsassociated with the first and second words 610 and 620 and transmit akeyword search result to the display apparatus 100.

If such a keyword search result is received, the processor 140 maycreate question sentences by combining the vocabulary column 710including the first and second words 610 and 620 and keywords 720(actor, title, director, and OST) included in the keyword search resultreceived from the answer server 300 based on a predefined standardsentence table.

As shown in FIG. 7 , the display unit 120 may display a questionsentence list 730 including the plurality of question sentences createdbased on the vocabulary column 710 including the first and second words610 and 620 and the keywords 720 (actor, title, director, and OST)included in the keyword search result received from the answer server300 on a screen. That is, the display unit 120 may display the questionsentence list 730 including a first question sentence “What is a titleof a movie AAA appears in?”, a second question sentence “Which actressdoes appear in the movie together with AAA?”, a third question sentence“What is an OST of the movie AAA appears in?”, and a fourth questionsentence “Who is a director of the movie AAA appears in?” on the screen.Thus, a user may select a question sentence intended by the user fromamong question sentences listed on the question sentence list 730displayed on the screen.

If one question sentence is selected through various exemplaryembodiments or if, as described above a user's speech, is a sentencespeech, the processor 140 may analyze the selected question sentence ora user's speech sentence that is converted into text (hereinafterreferred to as the question sentence) and create a question language. Inmore detail, the processor 140 may determine a sentence structure of thequestion sentence selected or spoken by the user using natural languageprocessing based algorithm and extract an object name present in thequestion sentence. Thereafter, the processor 140 may determine arelationship with knowledge desired by the user based on the extractedobject name.

According to an exemplary embodiment, the processor 140 may determinethe relationship with the knowledge desired by the user through alexical answer type (LAT) or focus method. In this regard, the LATmethod is a method of performing a question intention analysis byclassifying question sentences into forms thereof, and the focus methodis a method of finding attributes of answers present in questionsentences. Such the LAT and focus methods are well known, and thusdetailed descriptions thereof are omitted in the present disclosure.

If keywords corresponding to characteristic vectors are extracted fromthe question sentence selected or spoken by the user through suchvarious methods, the processor 140 may change the keywords extractedfrom the question sentence as a SPARQL language recognizable by theanswer server 300 and transmit a question language converted into theSPARQL language to the answer server 300. Accordingly, the answer server300 may generate or extract an answer result with respect to the userquestion sentence based on the question language received from thedisplay apparatus 100 and transmit the answer result to the displayapparatus 100.

For example, the question sentence selected or spoken by the user may be“What is a title of a movie AAA appears in?”, and keywords correspondingto characteristic vectors extracted from the question sentence may be“AAA”, “movie”, and “title”. In this case, as shown in FIG. 7 , theanswer server 300 may extract a first object c “movie_01” and a secondobject c “movie_02” based on “AAA”, “movie”, and “title” and transmit ananswer result to the display apparatus 100. Thus, the display unit 120may display answer results with respect to the first object c “movie_01”and the second object c “movie_02” related to the question sentence“What is a title of a movie AAA appears in?” based on the answer resultreceived from the answer server 300 on the screen.

A method in which the display apparatus 100 provides a question andanswer service based on user's speech voice will be described in detailbelow.

FIG. 8 is a flowchart of a method of the display apparatus 100 thatprovides a question and answer service according to an exemplaryembodiment of the present disclosure.

As shown in FIG. 8 , if user's speech voice is input in a voicerecognition mode, the display apparatus 100 may determine whether textof the input speech voice includes one or more words (operations S810and S820). According to an exemplary embodiment, if the user's speechvoice is input through a microphone (not shown), the display apparatus100 may convert the user's speech voice into text using a speech to text(STT) algorithm. Meanwhile, if a voice signal of the user's speech voiceis input through a remote control apparatus (not shown), the displayapparatus 100 may transmit the voice signal of the user's speech voiceto the voice recognition apparatus 200. Accordingly, the voicerecognition apparatus 200 may perform voice recognition on the voicesignal of the speech voice, convert the speech voice into text, andtransmit a voice recognition result to the display apparatus 100.

If the text is created with respect to the user's speech voice throughthe exemplary embodiment above, the display apparatus 100 may analyzethe text created from the user's speech voice and determine whether auser speech is a sentence speech or a word speech. According to anexemplary embodiment, the display apparatus 100 may analyze a sentencestructure of the text of the speech voice through a morpheme analysisand a syntax analysis that are natural language processing technologiesand determine whether the user speech is the sentence speech or the wordspeech.

If the user speech is a speech including one or more words through textanalysis on the speech voice, the display apparatus 100 may display oneor more question sentences related to one or more words (operationS830).

According to an exemplary embodiment, the display apparatus 100 maydetermine one or more sentences including keywords related to one ormore words among a plurality of previously stored sentences as thequestion sentences related to the user's speech voice. In more detail,the display apparatus 100 may compare similarity in a pronunciationcolumn between keywords corresponding to characteristic vectors of theplurality of previously stored sentences and the one or more words.Thereafter, the display apparatus 100 may determine and display asentence including a keyword having a high similarity with the one ormore words as the question sentence related to the speech voice.

Meanwhile, if a plurality of sentences are determined as the questionsentences, the display apparatus 100 may display the plurality ofsentences selected as the question sentences in order of high number oftimes previously selected as the question sentences based on selectionhistory information of each of the plurality of sentences.

Meanwhile, if a plurality of sentences determined as the questionsentences exceed a predetermined number, the display apparatus 100 mayselect and display sentences, as the question sentences, correspondingto the predetermined number from a sentence having high number of timespreviously selected as the question sentences from among the pluralityof sentences selected as the question sentences.

According to an exemplary embodiment, the display apparatus 100 maytransmit the one or more words to the answer server 300, if receivingkeywords related to the one or more words from the answer server 300,combine the received keywords, and create the question sentence withrespect to the user's speech voice. In this regard, the answer server300 is a triple structure knowledge base server and may perform anoperation of extracting keywords associated with a word spoken by theuser using attribute information related to the word and transmittingthe extracted keywords to the display apparatus 100.

According to an exemplary embodiment, when there is no sentenceincluding the keywords related to the one or more words among theplurality of previously stored sentences, the display apparatus 100 maytransmit the one or more words to the answer server 300. Thus, theanswer server 300 may perform an operation of extracting keywordsassociated with the one or more words received from the displayapparatus 100 using attribute information of the one or more words andtransmitting the keywords to the display apparatus 100. Through suchvarious exemplary embodiments, the display apparatus 100 may display theone or more question sentences related to the user's speech voice.

Accordingly, a user may select a question sentence intended by the userfrom among the question sentences displayed on the display apparatus100. As described above, if one question sentence is selected by theuser or if it is determined in operation S820 that the user's speech isthe sentence speech, the display apparatus 100 may analyze the selectedquestion sentence or a user's speech sentence that is converted intotext (hereinafter referred to as the question sentence), create aquestion language, and transmit the question language to the answerserver 300 (operation S840). Accordingly, the answer server 300 mayobtain one or more answer results related to information desired by theuser based on the received question language and transmit the one ormore answer results to the display apparatus 100. Accordingly, thedisplay apparatus 100 may display the one or more answer resultsreceived from the answer server 30 (operation S850).

In more detail, the display apparatus 100 may determine a sentencestructure of the question sentence selected or spoken by the user usingnatural language processing based algorithm and extract an object namepresent in the question sentence. Thereafter, the display apparatus 100may determine a relationship with knowledge desired by the user based onthe extracted object name. According to an exemplary embodiment, thedisplay apparatus 100 may determine the relationship with the knowledgedesired by the user through a lexical answer type (LAT) or focus method.

If keywords corresponding to characteristic vectors are extracted fromthe question sentence selected or spoken by the user through suchvarious methods, the display apparatus 100 may convert the keywordsextracted from the question sentence as a SPARQL language recognizableby the answer server 300 and transmit a question language converted intothe SPARQL language to the answer server 300. In this regard, the SPARQLbased language may be a language for questioning a document of asemantic web ontology language (RDF (Resource Description Framework),RDFs (Resource Description Framework Schema), or OWL (ONTOLOO WebLanguage)) provided in the W3C (World Wide Web Consortium) standard.

Meanwhile, if the question language converted into the SPARQL languageis received, the answer server 300 may generate or extract an answerresult with respect to the user question sentence based on the receivedquestion language and transmit the answer result to the displayapparatus 100.

As described above, according to one or more exemplary embodiments ofthe present disclosure, a display apparatus may provide an answer resultappropriate to a user's question intention although a non-sentencespeech is input.

The method of the display apparatus 100 that provides the question andanswer service as described above may be implemented as at least oneprogram to perform the method of providing the question and answerservice described above. The program may be stored in a non-transitorycomputer readable medium.

The non-transitory computer readable medium is a medium which does notstore data temporarily such as a register, cache, and memory, but storesdata semi-permanently and is readable by devices. More specifically, theaforementioned programs may be stored in the non-transitory computerreadable media such as compact disks (CDs), digital video disks (DVDs),hard disks, Blu-ray disks, universal serial buses (USBs), memory cards,and read-only memory (ROM).

The foregoing exemplary embodiments and advantages are merely exemplaryand are not to be construed as limiting the present disclosure. Thepresent teaching can be readily applied to other types of apparatuses.Also, the description of the exemplary embodiments of the presentdisclosure is intended to be illustrative, and not to limit the scope ofthe claims, and many alternatives, modifications, and variations will beapparent to those skilled in the art.

Although a few embodiments have been shown and described, it would beappreciated by those skilled in the art that changes may be made inthese embodiments without departing from the principles and spirit ofthe disclosure, the scope of which is defined in the claims and theirequivalents.

What is claimed is:
 1. An electronic apparatus, comprising: a display; acommunicator; an input receiver configured to receive a user input; avoice receiver configured to receive a voice input; and a processorconfigured to: based on a plurality of words being included in textinformation corresponding to a first voice input, receive a plurality ofsuggested combination texts corresponding to a combination of words ofthe plurality of words through the communicator from a server, controlthe display to display the received plurality of suggested combinationtexts, based on a second voice input or the user input being receivedfor selecting a suggested combination text of the plurality of suggestedcombination texts, receive information corresponding to the selectedsuggested combination text through the communicator from the server, andcontrol the display to display the received information corresponding tothe selected suggested combination text.
 2. The electronic apparatus asclaimed in claim 1, wherein the processor is further configured tocontrol the display to display the plurality of suggested combinationtexts in order of a number of times in which each suggested combinationtext of the plurality of suggested combination texts has been previouslyselected based on selection history information.
 3. The electronicapparatus as claimed in claim 1, wherein the processor is furtherconfigured to select and display a predetermined number of suggestedcombination texts from among the plurality of suggested combinationtexts based on a number of times in which each suggested combinationtext of the plurality of suggested combination texts has been previouslyselected based on selection history information.
 4. The electronicapparatus as claimed in claim 1, further comprising: a storageconfigured to store a plurality of sentences, and keywords correspondingto characteristic vectors for the plurality of sentences, respectively,wherein the processor is configured to compare a similarity in apronunciation column between the stored keywords corresponding to thecharacteristic vectors for the plurality of sentences, respectively, andthe first voice input, and identify a sentence of the stored pluralityof sentences including a keyword having a highest similarity with thefirst voice input as a suggested combination text of the plurality ofsuggested combination texts.
 5. The electronic apparatus as claimed inclaim 1, wherein the server is a triple structure knowledge base serverand extracts the plurality of suggested combination texts usingattribute information related to the plurality of words.
 6. Theelectronic apparatus as claimed in claim 4, wherein the processor isfurther configured to, when there is no sentence including a suggestedcombination text having a predetermined similarity with the first voiceinput among the plurality of sentences, receive at least one suggestedcombination text corresponding to the first voice input from the server.7. The electronic apparatus as claimed in claim 1, wherein the processoris further configured to, based on the text information being asentence, extract an object name from the text information using anatural language processing based algorithm and generate a questionlanguage based on the extracted object name.
 8. The electronic apparatusas claimed in claim 1, wherein the processor is further configured tocontrol the display to display a user interface for selecting at leastone suggested combination text of the plurality of suggested combinationtexts.
 9. The electronic apparatus as claimed in claim 1, wherein theprocessor is further configured to: control the communicator to receivea third voice input from an external electronic apparatus, and based ona plurality of words being included in text information corresponding tothe third voice input, receive a plurality of suggested combinationtexts corresponding to a combination of words of the plurality of wordsincluded in text information corresponding to the third voice inputthrough the communicator from the server.
 10. A controlling method of anelectronic apparatus, the controlling method comprising: based on aplurality of words being included in text information corresponding to afirst voice input, receiving a plurality of suggested combination textscorresponding to a combination of words of the plurality of words from aserver; displaying the received plurality of suggested combinationtexts; based on a second voice input or a user input being received forselecting a suggested combination text of the plurality of suggestedcombination texts, receiving information corresponding to the selectedsuggested combination text through a communicator from the server; anddisplaying the received information corresponding to the selectedsuggested combination text.
 11. The controlling method as claimed inclaim 10, wherein the displaying the plurality of suggested combinationtexts comprises displaying the plurality of suggested combination textsin order of a number of times in which each suggested combination textof the plurality of suggested combination texts has been previouslyselected based on selection history information.
 12. The controllingmethod as claimed in claim 10, wherein the displaying the plurality ofsuggested combination texts comprises displaying a predetermined numberof suggested combination texts from among the plurality of suggestedcombination texts based on a number of times in which each suggestedcombination text of the plurality of suggested combination texts hasbeen previously selected based on selection history information.
 13. Thecontrolling method as claimed in claim 10, further comprising: comparinga similarity in a pronunciation column between pre-stored keywordscorresponding to characteristic vectors for each pre-stored sentence ofa plurality of pre-stored sentences and the first voice input, andidentifying a pre-stored sentence of the plurality of pre-storedsentences including a keyword having a highest similarity with the firstvoice input as a suggested combination text of the plurality ofsuggested combination texts.
 14. The controlling method as claimed inclaim 13, further comprising: when there is no sentence including asuggested combination text having a predetermined similarity with thefirst voice input among the plurality of pre-stored sentences, receivingat least one suggested combination text corresponding to the first voiceinput from the server.
 15. The controlling method as claimed in claim10, further comprising: based on the text information being a sentence,extracting an object name from the text information using a naturallanguage processing based algorithm; and generating a question languagebased on the extracted object name.
 16. The controlling method asclaimed in claim 10, further comprising: displaying a user interface forselecting at least one suggested combination text of the plurality ofsuggested combination texts.
 17. The controlling method as claimed inclaim 10, further comprising: receiving a third voice input from anexternal electronic apparatus, and based on a plurality of words beingincluded in text information corresponding to the third voice input,receiving a plurality of suggested combination texts corresponding to acombination of words of the plurality of words included in textinformation corresponding to the third voice input through thecommunicator from the server.
 18. A non-transitory computer-readablestorage medium coupled to an electronic apparatus and storing a computerprogram for executing operations of: based on a plurality of words beingincluded in text information corresponding to a first voice input,receiving a plurality of suggested combination texts corresponding to acombination of words of the plurality of words from a server; displayingthe received plurality of suggested combination texts; based on a secondvoice input or a user input being received for selecting a suggestedcombination text of the plurality of suggested combination texts,receiving information corresponding to the selected suggestedcombination text through a communicator from the server; and displayingthe received information corresponding to the selected suggestedcombination text.
 19. The non-transitory computer-readable storagemedium as claimed in claim 18, wherein the operations further includedisplaying a user interface for selecting at least one suggestedcombination text of the plurality of suggested combination texts. 20.The non-transitory computer-readable storage medium as claimed in claim18, wherein the operations further include: receiving a third voiceinput from an external electronic apparatus, and based on a plurality ofwords being included in text information corresponding to the thirdvoice input, receiving a plurality of suggested combination textscorresponding to a combination of words of the plurality of wordsincluded in text information corresponding to the third voice inputthrough the communicator from the server.